Natural Language Processing Scientific Literature DEMONSTRATIVE ANAPHORA: FORMS AND FUNCTIONS IN FULL-TEXT SCIENTIFIC ARTICLES

نویسندگان

  • Emily G. Brassell
  • Emily Brassell
چکیده

This study examines the functions and characteristics of demonstrative anaphora (this, these, that, those) in a collection of full-text scientific documents, confirming that they play an important role in maintaining discourse focus and binding together cohesive sections of text. Unlike corpora in other subject domains, the Cystic Fibrosis database contains more demonstrative expressions than any other class of anaphora. As participants in intersentential reference, demonstratives often refer to complex propositions rather than simple noun phrases. While this tendency complicates automated resolution, our results yield some suggestions toward a resolution algorithm. Primarily, we argue for the incorporation of demonstrative form since different types of demonstratives show different patterns regarding antecedent length and composition. Although further analysis is necessary, our findings provide a groundwork for future exploration.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On the generation and interpretation of demonstrative expressions

This paper presents necessary and sufficient conditions for the use of demonstrative expressions in English and discusses implications for current discourse processing algorithms. We examine a broad range of texts to show how the distribution of demonstrative forms and functions is genre dependent. This research is part of a larger study of anaphoric expressions, the results of which will be in...

متن کامل

Characterizing In-Text Citations Using N-Gram Distributions

Introduction This article focuses on a Natural Language Processing (NLP) approach for the analysis of citation functions in scientific papers. Bibliometric studies traditionally rely on citation metadata and count the number of times a publication has been cited. However, some recent studies rely also on full text processing on papers, e.g. (Boyack et al., 2013), (Bertin et al., 2013, 2014). Th...

متن کامل

Systematic Parameterized Description of Pro-forms in the Prague Dependency Treebank 2.0∗

A pro-form is a word that is used to replace or substitute other words, phrases, clauses, or sentences etc. Besides pronouns one can also distinguish pro-adjectives, pro-numerals, pro-adverbs, and pro-verbs.1 Pro-forms are related to a wide range of linguistic phenomena, from wordformative principles, through negation and quantification, to anaphoric and deictic functions. As it was recognized ...

متن کامل

Assessing the Quality of Persian Translation of the Book “Principles of Marketing” Based on the House’s (TQA) Model

Translation is evaluated in terms of its forms and functions inside the historically developed systems of the receiving culture and literature. This study aimed to evaluate the quality of Persian translation of the14th edition of the original English book “Principles of Marketing” written by Philip Kotler and Gary Armstrong based on House (TQA) model: overt and covert translation distinction. T...

متن کامل

A Corpus-Based Study of Demonstratives in German, Russian and English

The current article presents results from three quantitative corpus studies on the use of demonstrative expressions (demonstrative NPs, demonstrative pronouns) in German, English and Russian. It focuses on two prevalent hypotheses: 1) demonstratives correspond to the medium activation level; 2) demonstratives establish discourse topics. As for (1), it has been repeatedly claimed that referentia...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000